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ABSTRACT 


e Shlpsysten  Maintenance  Monitoring  and  Support 
Office  (SMMSO)  la  establishing  a new  procedure  whereby 
data  collected  by  a SMMS  Site  Team  will  be  typed  Into 
a DEC  ?DP-ll/03  minicomputer  at  or  near  the  data  col- 
lection site>  A routine  In  the  FDP-11  will  check  to  see 
that  each  number  entered  lies  within  the  bounds  estab- 
lished for  Its  data  field*  This  report  documents  a 
FORTRAN  computer  model,  called  CHECK,  which  has  been 
used  to  analyze  existing  data  for  each  numerical  field 
and  to  suggest  sets  of  bounds  for  use  In  the  POP-11* 

Each  data  set  examined  by  CHECK  Is  tested  for  possible 
adherence  to  assumptions  of  exponentlallty  or  normality* 
Bounds  are  suggested  based  on  the  results  of  these 
tests*  In  addition,  CHECK  indicates  the  expected  proba- 
bility that  another  data  value  from  the  same  distribu- 
tion will  lie  within  the  suggested  bounds* 

ADMINISTRATIVE  INFORMATION 

This  report  Is  a result  of  work  performed  under  Program  Element 
60000N,  Task  Area  OMN,  Work  Unit  1870-003,  SEC  PO  70298  and  Work  Unit 
1870-018,  SEC  PO  80420  for  NAVSEC  6107E~ 


1.  INTRODUCTION 


1 . 1 BACKGROUND 

The  Shlpsystem  Maintenance  Monitoring  and  Support  Office  (SMMSO)  la 
establishing  a new  procedure  whereby  data  collected  by  a SMMS  Site  Team 
(SST)  will  be  typed  into  a DEC  PDP-11/03  minicomputer  (hereafter  referred 
to  as  PDP-11)  at  or  near  the  data  collection  site*  A routine  in  the  PDP-11 
will  check  to  see  that  each  number  typed  in  lies  within  bounds  established 
for  its  data  field.  If  any  number  does  not  lie  within  the  bounds  estab- 
lished, the  PDP-11  will  alert  the  SST  member  entering  the  data  and  allow 
him  to  reenter  the  number,  leave  the  field  blank  until  further  testing  can 
be  conducted,  or  force  the  PDP-11  to  accept  the  number  as  typed.  Since 
the  PDP-11  is  on-site  and  the  typing  is  to  be  done  in  the  same  time  frame 
as  the  testing,  many  errors  due  to  misreading  measurement  equipment, 
writing  incorrect  numbers  on  the  Maintenance  Requirement  Cards  (MRCs),  and 
forgetting  to  perform  tests  or  write  down  results  can  be  corrected.  The 
SST  will  often  be  able  to  return  to  the  ship  and  conduct  further  testing 
if  required.  In  addition,  many  keypunching  errors  will  be  avoided. 

1.2  PURPOSE 

A FORTRAN  computer  model  called  CHECK  has  been  used  on  the  CDC  6000 
computers  at  the  David  W.  Taylor  Naval  Ship  Research  and  Development 
Center  (DTNSRDC)  to  analyze  existing  data  for  each  numeric  field  on  the 
MRC's.  Output  of  this  model  includes  one  or  more  suggested  sets  of  bounds 
for  use  in  the  PDP-11.  The  results  of  using  the  model  on  the  MRC  numeric 
fields  are  available  from  SMMSO  (NAVSEC  6107E1).  This  report  provides 


2.  DESCRIPTION  OF  CHECK 

CHECK  has  six  major  parts:  the  data  reader,  the  exponentlallty 
checker,  the  exponential  bounds  calculator,  the  normality  checker,  the 
normal  bounds  calculator,  and  the  range  bounds  calculator.  Each  part 
Involving  calculation  of  bounds  also  Includes  calculation  of  the  esti- 
mated probability  that  some  future  measurement  will  also  be  found  within 
the  calculated  bounds.  The  subroutine  WSTEST  Is  used  by  the  exponentlal- 
ity  checker  and  the  normality  checker. 

2.1  THE  DATA  READER 

The  data  reader  Is  designed  to  read  data  In  a free  format.  The  first 
thirty  characters  of  each  data  record  are  read.  Each  character  Is  then 
compared  In  Its  turn  to  fourteen  "admlssable"  characters  consisting  of 
the  ten  digits,  plus,  minus,  decimal  point,  and  blank.  The  number  Is 
constructed  as  the  Identity  of  the  characters  read  Is  determined  by  the 
comparisons.  The  comparisons  end  when  the  first  of  the  following  occurs: 
o All  thirty  characters  have  been  determined 

o A blank  Is  encountered  after  at  least  one  digit  has  been  found 
(this  Indicates  that  the  complete  number  has  been  read) 

o An  Inadmlssable  character  Is  encountered 

o Ten  consecutive  minus  signs  and  decimal  points  have  been  encoun- 
tered (this  indicates  the  end  of  the  data  set). 

Note  that  two  or  more  numbers  cannot  be  read  from  the  same  data  record. 

2.2  THE  EXPONENTIALITY  CHECKER 

Epstein  described  twelve  tests  for  the  validity  of  the  exponen- 
tial life  assumption  sometimes  used  In  reliability  tests.  Two  of  these, 
a large  sample  test  and  a uniformity  test,  have  been  adapted  for  use  in 
CHECK.  A third  test  Is  used  as  a final  selection  criterion  for  Inclusion 
of  exponential  bounds  on  a summary  file,  TAPE2  (see  subsection  3.3). 


*A  complete  listing  of  references  Is  given  on  page  49. 


2.2.1  A Large  Sample  Test 

If  X2,  ...,  Is  a random  sample  from  an  exponential 
distribution  arranged  In  ascending  order  (l.e.,  X^  < then  a 

plot  of  Xj  vs  ln(n+l-j)  can  be  fitted  well  by  a straight  line.  As  an 
Indication  of  how  well  these  points  can  be  fitted  with  a straight  line* 
CHECK  calculates  (In  WSTEST)  and  prints  the  coefficient  of  determination 
of  the  least  squares  best  linear  fit  to  the  data*  as  given  by 


where,  for  1-1,2 Y^-ln(n4-l-i) , and  where  X Is  the  average  of 

the  X/s  and  ? Is  the  average  of  Y/s.  If  R2  Is  close  to  one  (say, 
i ^ 

above  0.975),  the  fit  Is  considered  to  be  good.  As  Espteln  pointed  out 
at  least  fifty  data  values  are  required  for  this  test  to  be  meaningful. 


2.2.2  A Uniformity  Test 

If  Xj^,  X2,...,Xjj  Is  a random  sample  (unordered)  from  an  exponen- 
tial distribution  with  probabllty  density  function  (pdf) 


(l/B)exp[-(x-C)/B] 


and  if  m Is  the  largest  integer  no  larger  than  n/2,  then 


f 


(3) 


ui 

(n-m)  ^ (X^-C 

1-1 

n 

m ^ (X^-C5 


l-mf  1 


has  an  F distribution  with  2m  and  2 (n-m)  degrees  of  freedom.  CHECK  uses 
the  smallest  of  the  X 's  as  an  approximation  to  C,  calculates  f,  and 

1 3 

uses  the  International  Mathematical  and  Statistical  Libraries  (IMSL) 
routine  MDFD  to  find  the  significance  level  associated  with  f. 

2.2.3  A Two-Moment  Test 

The  standard  deviation  of  a random  variable  having  an  exponential 
distribution  with  pdf  given  in  Equation  (2)  is  equal  to  the  expected  value 

4 

of  that  random  variable  less  C (see  Johnson  and  Rotz,  Chapter  18).  The 
sample  standard  deviation  of  a random  sample  X^^,  X^  Is  given 


(X^-X)^/(n-l)J 


If  the  sample  mean  minus  the  smallest  sample  value  Is  less  than  half,  or 
more  than  twice,  the  sample  standard  deviation,  then  exponent lallty  of  the 
data  must  be  doubted. 

2.3  THE  EXPONENTIAL  BOUNDS  CALCULATOR 

If  X Is  a random  variable  having  an  exponential  distribution  with 
pdf  as  given  In  Equation  (2),  then  X can  never  take  on  a value  smaller 
than  C.  Thus,  C forms  a natural  lower  bound  for  X.  As  will  be  seen  in 
the  next  paragraph,  C'4-t(X-C)  is  a natural  upper  bound. 


2.3.1  Lower  Limit  Known 

If  X . X~ X Is  a random  sample  from  an  exponential  distri- 

1’  2’  ’ n 

butlon  with  pdf  given  In  Equation  (2),  then  each  (X^  - C)/B  is  known  to 
have  a gamma  distribution  with  parameters  1,  1,  and  0 (see  Johnson  and 
Kotz,^  Chapter  17,  sections  1 and  5).  Thus, 


n(X-C)/B 


- 


has  a gamma  distribution  with  parameters  n,  1,  and  0 (see  Johnson  and 
Kotz,^  Chapter  17,  Section  2).  That  Is,  the  pdf  of  Y is 


y”  ^e  (n-1) ! 


if  y 2 0 

if  y < 0 


If  X Is  a random  variable  with  pdf  given  by  Equation  (2),  then 


Prob(X>C+t(X-C)]  = Prob[X>C+(tBY/n)] 


= j Prob[X>C+(tBy/n) |Y-y]dF^(y) 

*'0 

- f exp(-[C+(tBy/n)-C]/B)y’^"^e"ydy/(n-l)l 

•'o 

- f exp(-y tl+(t/n)] )y”“^dy/(n-l) ! 

*'0 

- /**  z"‘V*dz/((n-l)![l+(t/n)]”] 

•'0 

- ll+(t/n)r“  ( 


since  Equation  (6)  is  a pdf  (and,  therefore.  Integrates  to  one).  From 
Equation  (7)  it  is  seen  that 

Prob[X<C+t(X-C)]  - 1 - [l+(t/n)]"" 


am 


Note  that  once  X has  been  calculated,  the  probability  that  another  Inde- 
pendent draw  from  the  exponential  distribution  Is  no  larger  than  C+t(X-C) 
depends  only  on  the  value  chosen  for  t*  In  particular,  it  Is  Independent 
of  the  parameter  B. 

2.3.2  Lower  Limit  Unknown 

Unfortunately,  the  value  of  the  parameter  C is  also  unknown  in  most 

cases.  Johnson  and  Kotz^  (Chapter  18,  section  5)  suggest  the  smallest 

of  X,,  X as  an  estimate  for  C.  If  this  value  is  denoted 

1 / n 

by  L ■ C+a  (a  must  be  positive  by  the  nature  of  the  exponential  distri- 
bution) , then 

Prob[X<L+t(X-L)]  - Prob[X£C+t(X-C)+a(l-t)] 

< ProblX<C+t(X-C)]  - 1 - Il+(t/n)]"®  (9) 

so  long  as  t>l.  In  order  for  Equation  (9)  to  provide  a "conservative" 
estimate,  the  Inequality  would  have  to  be  reversed.  However,  since  the 
reason  for  the  inequality  is  the  difference  between  L and  C,  the  right 
hand  side  of  Equation  (9)  provides  a good  estimate  so  long  as  the  physi- 
cal situation  prevents  the  data  values  from  dropping  below  L. 

2.3.3  Use  Decision 

This  paragraph  describes  the  criteria  used  in  CHECK  to  decide  whether 
or  not  a set  of  bounds  based  on  an  exponential  assumption  is  to  be  recom- 
mended, and  if  so,  which  set.  Any  recommended  bounds  would  be  Included 
on  TAPE2  (see  subsection  3.3).  The  criteria  presented  here  are  based  on 
the  professional  judgment  of  the  author  and  the  requirements  of  the  appli- 
cation for  which  CHECK  was  first  written. 

CHECK  contains  a variable,  PCIND,  whose  value  determines  whether  ex- 
ponential bounds  are  to  be  recommended.  The  value  of  PCIND  is  Initially 
set  according  to  the  result  of  the  large  sample  test  (see  paragraph  2.2.1) 
If  R2  of  Equation  (1)  is  less  than 


FN  0.30255  In  n - 1.8237537  + 2.2946 

0.98 


for  n.<  5 

for  5<n<  397  (10) 

for  n>  397 


PCIND  Is  set  equal  to  minus  one,  ultimately  rejecting  a recommendation 
of  exponential  bounds.  If  R2  Is  larger  than  or  equal  to  FM,  the  Initial 
value  of  FCIND  Is  Initially  set  according  to 


PCIND  - (1-e 


to. 04(75-n) 1^^^-1.44224952 


)R2/2 


+ 0.05(R2-FN)/(1-FN) 


(11) 


Figure  1 Is  a plot  of  FM  (Equation  (10)).  Figure  2 Is  a plot  of  the 
first  term  of  Equation  (11).  This  figure  Indicates  that,  for  R2  close  to 
one,  PCIMD  will  start  out  close  to  1/2  for  large  sample  sizes. 

Next,  PCIND  Is  Increased  by  the  significance  level  of  the  uniformity 
test  (see  paragraph  2.2.2).  Finally,  PCIND  Is  set  equal  to  zero  If 
the  sample  standard  deviation  (s  of  Equation  (4))  Is  more  than  1.8  times 
X-L  (see  Equation  (9)),  or  If  X-L  Is  more  than  1.8  times  s (see  paragraph 
2.2.3).  Exponential  bounds  are  recommended  If  PCIND  Is  at  least  1/2. 

The  recommended  bounds  will  be  such  that  the  expected  probability  Is 
at  least  0.9775  that  another  Independent  draw  from  the  same  presumed 
exponential  distribution  will  lie  within  the  bounds. 

2.4  THE  NORMALITY  CHECKER 

The  normality  checker  uses  a skewness  test,  the  Shaplro-Wllk  test 
(for  samples  of  size  fifty  or  less),  and  a transformed  D statistic  test  (for 
samples  of  size  larger  than  fifty)  to  test  for  normality.  Shapiro  et  al.. 

In  a comparative  study  of  nine  different  tests,  found  that  the  Shaplro- 
Wllk  test  was  the  best  overall  test  for  normality,  particularly  for  small 
sample  sizes,  but  that  the  skewness  test  was  much  better  against  skewed 
alternatives.  The  transformed  D statistic  test  used  here  was  Introduced 
by  D'Agostino^  several  years  later  and,  therefore,  was  not  Included 
among  the  nine  tests  compared  by  Shapiro  et  al. 


(A'M  _ 


Figure  2 - Grap' 
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2.4.1  A Skewness  Test 

The  skewness  test  In  CHECK  uses  the  approximate  distribution  of  a 


transformation  of  the  sample  skewness.  The  transformation  was  Introduced 


by  Johnson.  For 


>1  ■ 'v*>  j 


3/2 


(12) 


and 


+27n-70) (n+1) (n+3) 

2)(n+5)/n+y)(n+$)'  ■ 


-0 


1/2 


(13) 


8 


D'Agostino  proved  that  the  distribution  of 


Z - R ln[{y/a)^^^  + (1  + y/a)^^^] 


(14) 


Is  approximately  standard  normal  (l.e.,  with  mean  zero  and  variance  one) 
If  ...»  Is  a random  sample  from  a normal  distribution  and 


R - (In  W) 


-1/2 


(15) 


y-  [bj^‘^(n+l)(n+3)]/[6(n-2)I 


a - 2/(V-l) 


(16) 

(17) 


This  test  can  be  used  only  on  samples  with  at  least  eight  data  values. 


To  determine  the  significance  level  corresponding  to  a particular 

4 

value  of  Z,  the  following  approximation  (see  Johnson  and  Kotz,  page 


55)  to  the  standard  normal  distribution  function  Is  used: 


11 


ProbtX£x]  w 1 - 0.5(1  + 0.196854X  + 0.115194x 

+ 0.000344x^  + 0. 001952 7x^)“^  (181 

for  x^O  and  X having  a standard  normal  distribution. 

2.4.2  The  Shaplro-Wllk  Test 

If  X, . X Is  a random  sample  arranged  in  ascending  order, 

1’  z’  ’ n 

the  Shaplro-Wilk  test  statistic  is  given  by 


2 . n 


where  the  vector  a^  ■ ^*ln»  ****  ®nn^  given  by 


a - m'v"^/(m'v"V^m)^^^ 

n 


and  where  m is  the  vector  of  expected  values,  and  V the  covariance  matrix, 
of  the  order  statistics  of  a sample  of  size  n from  a standard  normal 
distribution.  The  properties  of  this  statistic  were  first  examined  by 
Shapiro  and  Wllk,^  who  tabulated  the  values  a^^  for  n from  2 through 
50.  Among  the  properties  of  W,  If  the  X^'s  are  from  a normal  distribu- 
tion with  any  mean  and  variance,  is  that  the  value  W has  a lower  bound  of 


E ■ na, _ /(n-1) 
n In 


and  an  upper  bound  of  one.  In  CHECK  the  •j^'s  are  in  an  array  called  A 

and  the  E 's  are  In  an  array  called  EPSILH.  The  significance  level  for 
n 

a given  value  of  W is  found  by  applying  the  approximation,  given  in 


Equation  (18),  of  the  standard  nonal  distribution  to  the  folloving 
transfonation  of  W: 


ZA  - lot(W-B„)/(l-W)l  - G, 


The  values  of  D and  G are  in  arrays  DELTA  and  GAMMA,  respectively,  in 
n n 

CHECK* 


2.4.3  The  Transfomed  D Statistic  Test 

If  Xj^,X2,  la  a random  sample  arranged  in  ascending  order, 

the  D statistic  is  given  by 


II  - (irt-l)/2]  X 


For  values  of  n from  50  to  1000  for  various  significance  levels, 
D'Agostino^  has  tabulated  the  critical  points  of  the  following  trans 
formation  of  D 


'S'  (D  - 0.28209479 
0.02998598 


under  the  assumption  that  the  random  sample  is  from  an  arbitrary  normal 
distribution.  A curve-fitting  routine  was  used  to  derive  functions  for 
the  5 percent,  10  pecent,  90  percent,  and  95  percent  significance  levels 


I 

I 


CHECK  uses  these  functions  to  deternine,  for  a given  saaple  slse,  the 
critical  points  corresponding  to  these  four  significance  levels.  A 
judgment  concerning  the  normality  of  the  data  Is  then  based  on  a com- 
parison of  the  sample  value  for  T,  the  transformed  D statistic,  with 
these  four  critical  points. 

2.5  THE  NORMAL  BOUNDS  CALCULATOR 

2.5.1  Calculation  of  Bounds 

If  Xj^,  X2,  •••»  X^  is  a random  sample  from  a normal  distribution, 
then  for 


g - t(n+l)/n]^^^  t(l-p/2) 


(25) 


where  t(l-p/2)  Is  the  (l-p/2)  critical  point  of  a Students'  t distribution 

with  n-1  degrees  of  freedom,  the  probability  that  another  Independent 

draw  from  the  same  normal  distribution  will  lie  between  X-gs  and  X4-gs  is 

4 

1-p  (see  Johnson  and  Kotz,  p.  74).  The  same  result  holds  If 


g^  “ t(n+l)/n]  F(l-p) 


(26) 


where  F(l-p)  Is  the  (1-p)  critical  point  of  an  F distribution  with  one 

3 

and  n-1  degrees  of  freedom.  CHECK  uses  the  IMSL  routine  MDFD  to 

2 

determine  the  significance  level  for  ng  /(nfl). 

2.5.2  Use  Decision 

This  paragraph  describes  the  criteria  used  In  CHECK  to  decide  whether 
or  not  a set  of  bounds  based  on  a normal  assumption  Is  to  be  recommended, 
and  If  so,  which  set.  Any  recommended  bounds  would  be  Included  on  TAPE2 
(see  subsection  3.3).  The  criteria  presented  here  are  based  on  the 
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professional  Judgment  of  the  author  and  the  requirements  of  the  applica- 
tion for  which  CHECK  was  first  written. 

The  decision  to  recommend  normal  bounds  or  not  Is  based  on  the  results 
o£  the  skewness  test  discussed  In  paragraph  2.4.1  and,  depending  on  the 
size  of  the  data  set,  on  the  results  of  either  the  Shaplro-Wllk  test  (see 
paragraph  2.4.2)  or  the  transformed  D statistic  test  (see  paragraph 
2.3.4).  As  In  the  exponential  bounds  calculator  (see  paragraph  2.3.3), 
CHECK  uses  the  value  of  PCIMD  to  determine  whether  normal  bounds  are  to 
be  recommended. 

If  the  number  of  data  values  is  less  than  eight,  the  skewness  test 
cannot  be  used  and  normal  bounds  are  recommended  only  If  the  significance 
level  of  the  Shaplro-Vllk  test  Is  at  least  0.5.  For  larger  samples, 
normal  bounds  are  recommended  If  the  significance  of  the  skewness  test 
Is  at  least  0.9  and  either  the  significance  level  of  the  Shapiro-Wllk  test 
Is  at  least  0.48  (50  data  values  or  less)  or  the  D statistic  lies  between 
the  10  percent  and  90  percent  significance  values  (more  than  50  data 
values).  Alternatively,  normal  bounds  will  be  recommended  If  the  signifi- 
cance of  the  skewness  test  Is  at  least  0.75,  and  either  the  significance 
level  of  the  Shaplro-Wllk  test  Is  at  least  0.5  or  the  D statistic  lies 
between  the  5 percent  and  95  percent  significance  values.  As  in  the 
exponential  case,  any  recommended  bounds  will  be  such  that  the  expected 
probability  Is  at  least  0.9775  that  another  Independent  draw  from  the  same 
presumed  normal  distribution  will  lie  within  the  bounds. 

2.6  THE  RANGE  BOUNDS  CALCULATOR 

If  Xj^,  X2,  ...,  is  a random  sample  from  any  continuous  distri- 
bution arranged  In  ascending  order,  the  expected  value  of  the  probability 
P that  some  future  Independent  draw  from  the  same  distribution  will  lie 
between  X^  and  X^,  where  s<t.  Is  given  by 

E(P)  =■  (t-s)/(n+l)  (27) 


(see  Wllks^*^).  In  particular,  the  expected  probability  that  some 
future  draw  will  be  between  Xj^  and  X^j  is  (n-l)/(n+l). 
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P is  actually  a random  variable*  As  n increases,  the  expected  value 
of  P Increases  rapidly*  However,  the  probability  that  P Is  within  0*005  of 
Its  expected  value  (l*e*,  accuracy  In  the  second  decimal)  starts  low  and 
Increases  slowly*  Figure  3 exemplifies  these  statements*  Some  of  the 
detail  lost  In  Figure  3 Is  summarized  In  Table  1*  Note  from  Table  1 that 
the  expected  value  of  P Is  already  0*75  with  Just  seven  data  points,  but 
the  probability  that  the  actual  value  of  P would  round  to  0*75  Is  only  2*5 
percent*  Similarly,  E(P)  is  0*9  at  nineteen  data  points,  but  the  proba- 
bility that  the  actual  value  would  round  to  0*90  Is  only  5*7  percent*  The 
probability  that  F is  within  0*005  of  Its  expected  value  doesn't  reach  90 
percent  until  the  nuiid>er  of  data  points  Is  about  390*  By  that  time  the 
expected  value  of  P Is  nearly  0*995* 

Because  of  the  requirements  of  the  application  for  which  CHECK  was 

first  written,  X,  and  X are  Included  on  TAPE2  (see  subsection  3*3)  as  a 
I n 

suggested  set  of  range  bounds  for  each  data  set  analyzed* 


1000 


0 600 

NUMBER  OF  DATA  VALUES  (n) 

Figure  3 - Properties  of  P ■ Prob  [X^  £ X _<  X^] 
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3.  USE  OF  CHECK 

3.1  INPUT 

The  input  to  CHECK  is  a series  o£  data  sets  to  be  examined.  Each 
data  set  has  the  following  form: 

o Title 
o Data 

o End  indication 

The  title  must  be  contained  in  the  first  thirty  columns  of  the  first 
record  of  the  data  set.  Each  data  value  must  be  contained  on  a separate 
record  and  within  the  first  thirty  columns  of  its  record.  The  data  value 
is  considered  complete  at  the  first  blank  after  at  least  one  digit  (see 
subsection  2.2).  The  end-of-data-set  Indicator  is  a string  of  ten  or 
more  minus  signs  and/or  decimal  points  contained  in  the  first  thirty 
columns  of  a record.  No  data  value  or  title  may  be  contained  on  the 
same  record  as  the  end-of-data-set  indicator. 

As  presented  in  Appendix  B,  CHECK  reads  all  input  from  a file  called 
TAPE3.  This  may  be  an  attached  permanent  file  or  a local  file  created 
prior  to  execution  of  the  program.  If  It  is  desired  to  read  the  input 
from  cards,  the  PROGRAM  card  (the  first  card  in  CHECK)  should  be  changed 
to  replace  "TAPE3,"  with  "TAPE3-INPUT,"  (see  the  listing  of  CHECK  in 
Appendix  B) . Sample  Input  Is  given  in  Appendix  C. 

3.2  MECHANICS  OF  USE 

Use  of  CHECK  on  the  CDC  6000  computers  at  DTNSROC  is  Implemented 
through  the  following  set  of  control  cards: 


Job  card 
Charge  card 


ATTACH, TAPE3. 
ATTACH, IMSL. 


Description 

Standard 

Standard 

Loads  CHECK 

Attaches  the  data  set 

Attaches  the  IMSL  library 


/ 


LDSET.LIB-IMSL 


r 


I 


I 

I 


I 


! 

i 


i 


Loads  the  IMSL  Library. 

This  Is  necessary  since  the 
DfSL  routine  MDFD  Is  used. 


LGO.  Executes  the  prograa 

If  CHECK  Is  provided  on  a permanent  file  (say,  one  called  CHECK)  rather 
than  In  card  form,  the  FTN  card  can  be  replaced  by 


ATTACH,TEMP, CHECK, . . . 
FTH(I-TEMP) 


I 1 
i 

r ■ 


1 


i 

If  a compiled  version  of  CHECK  Is  available  on  a permanent  file  (again 

one  called  CHECK) , the  FTN  card  may  be  replaced  by  1 

ATTACH , LGO , CHECK ,.. . 

If  the  Input  data  are  provided  on  cards  rather  than  on  a permanent  file, 
and  If  changing  CHECK  as  specified  In  subsection  3.1  Is  undesirable, 
the  ATTACH, TAPE 3,...  card  may  be  replaced  by 

COPYBF ( INPUT ,TAPE3 ) 

REWIND, TAPE3. 

If  the  summary  Information  on  TAPE2  (see  subsection  3.3)  Is  to  be  cataloged  . , 

as  a permanent  file, 

REQUEST .TAPE2 , *PF. 


f 

r 

} 


I ^ 


should  be  Included  after  the  FTN  card  and 
CATALOG, TAPE2,... 


should  be  Included  after  the  LGO  card.  If  this  summary  Information  Is  to  be 
printed  Instead,  the  following  cards  should  be  Included  after  the  LGO  card: 

REWIND, TAPE2. 

COPYSBF (TAPE2 ,OUTPUT ) 
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3.3  OUTPUT 

The  output  of  CHECK  Is  In  two  parts,  printed  and  file.  The  printed 

output  for  each  data  set  consists  of 

o The  data  set  title 

o Error  messages 

o The  number  of  blank  fields 

o The  data  values  In  ascending  order 

o The  number  of  data  values 

o The  average  of  the  data  values 

o The  standard  deviation  of  the  data  values 

o The  results  of  the  large  sample  test  for  exponentlallty 
(see  paragraph  2.3.1) 

o The  results  of  the  uniformity  test  for  exponentlallty 
(see  paragraph  2.3.2) 

o An  Indication  of  the  bounds  to  be  used  If  an  exponential 
assumption  Is  to  be  made.  Including  a table  of  such  bounds 
(printed  only  If  an  exponential  assumption  Is  not  unreal- 
istic) 

o The  results  of  the  skewness  test  for  normality  (see  para- 
graph 2.5.1)  If  the  number  of  data  values  Is  at  least  eight 

o The  results  of  the  Shaplro-Wllk  test  for  normality  (see 
paragraph  2.5.2)  If  the  number  of  data  values  Is  no  larger 
than  fifty 

o The  result  of  the  transformed  D statistic  test  for  normal- 
ity (see  paragraph  2.5.3)  If  the  number  of  data  values  Is 
larger  than  fifty 

o An  Indication  of  the  bounds  to  be  used  If  a normal  assump- 
tion Is  to  be  made.  Including  a table  of  such  bounds 
(printed  only  If  a normal  assumption  Is  not  unrealistic) 

o One  or  more  sets  of  range  bounds 

The  file  output  Is  a summary  of  the  printed  output.  Included  In  the 

file  output  for  each  data  set  are  a line  for  exponential  bounds  If  the 

three  tests  discussed  In  subsection  2.3  Indicate  that  an  exponential 

assumption  Is  reasonable,  a line  for  normal  bounds  If  the  tests  discussed 

In  subsection  2.5  Indicate  that  a normal  assumption  Is  reasonable,  and  a 

line  for  range  bounds.  Each  line  of  the  file  output  contains 
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o 

o 

o 

o 

o 

o 


The  data  set  title 

The  number  of  data  points 

The  average  of  the  data  values 

The  standard  deviation  of  the  data  values 

Suggested  lover  and  upper  bounds 

Expected  probability  that  a future  value  would  lie  within 


the  bounds,  given  this  distributional  assumption 

o An  Indication  of  the  distributional  assumption  (E  for 
exponential,  N for  normal,  R for  range  or  no  distribu- 
tional assumption) 

File  output  from  CHECK  Is  provided  In  the  form  of  a local  file  called 
TAPE2.  Disposition  of  TAPE2  Is  discussed  In  subsection  3.2. 

Samples  of  both  printed  and  file  output  are  given  In  Appendix  C. 
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APPENDIX  A 
6L0SSART 


CDC 

Control  Data  Corporation 

DEC 

Digital  Equipment  Corporation 

DTNSRDC 

David  W.  Taylor  Naval  Ship  Research  and 
Development  Center 

exp 

Exponential  function;  exp(a)  ■ e* 

IMSL 

International  Mathematical  and  Statistical 
Library 

In 

Natural  (base  e)  logarithm  function 

MDFD 

IMSL  statistical  subroutine 

MRC 

Maintenance  Requirement  Card 

NAVSEC 

Naval  Ship  Engineering  Center 

pdf 

Probability  density  function 

SMMS 

Shlpsystem  Maintenance  Monitoring  and  Support 

SST 

SMMS  Site  Team 

standard  normal 
distribution 

Normal  distribution  with  mean  zero  and 
variance  one 

' 

I 

' ! 
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APPENDIX  B 
PROGRAM  LISTING 


PR06RAN  CHECK< INPUT, OUTPUT , T APE«>OUTPUT , T APE3 , T APE2 ) 

C USE  THIS  ONE  FOR  GENERAL  DATA  SETS 
C REPORT  DATA  WILL  BE  GENERATED 

DIHENSION  X<1200),C(14),FLD(30),TITL<3) 

DATA  (C(I>,I»1,14)/1H0,1H1,1H2,1H3,1H4,1H5,1H6,1H7,1H8,1H9, 
,1H*/ 

REUIND  3 
N *0 

C444CHAN6E  THE  FOLLOUING  CARO  AS  NECESSARY  TO  NATCH  THE  DIHENSION 
CllfOF  X. 

NX*1200 
WRITE (6,3000) 

NUH»0 

C ***  TITLE  READER  *** 

10  READ(3,5003)  (TITLd) ,I>1 ,3) 

IF(E0F(3))  420,20 
20  HUH  B NUN  > t 
URITE<6,3003)  NUN 
URITE(6,500t)  (TITL(I),I»1,3> 

N«0 

NBFLDsO 

C ***  DATA  READER  *** 

30  READ(3,5000)  (FLD(I),I>1,30) 

NCOL-0 

N»N+1 

NBLANK^O 

NEND>0 

SIGN* 1.0 

DUILD«0.O 

UHOLEslO.O 

FRsl.O 

FRACT=1 .0 

IDO«0 

C ***  DATA  INTERPRETER 
40  NCOL»NCOL«1 
CCOL*FLD<NCOL) 

C IF  ALL  COLUNNS  HAVE  BEEN  READ,  GO  TO  70 

IF<NC0L.GT.30)  GO  TO  70 

C IF  CURRENT  COLUNN  IS  BLANK,  GO  TO  120 

IF(CC0L.E0.C(13))  60  TO  120 
C IF  CURRENT  COLUNN  HAS  NINUS,  60  TO  110 

IF(CC0L.EQ.C(12))  60  TO  110 
C IF  CURRENT  COLUNN  HAS  DECINAL,  GO  TO  90 

IFiCCOL.EO.CdD)  60  TO  90 

C CHECK  TO  SEE  IF  CURRENT  COLUNN  HAS  A DIGIT 

DO  SO  I>1,10 

IF(CCOL.NE.C(I))  GO  TO  SO 
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NEND-0 

1D6-I 

FRACT-FRACT*FR 

lUILD  ■ IUILD*UHOLE  + FRACT*FLORT< 1-1 ) 

60  TO  40 
SO  CONTINUE 

C IF  CURRENT  COLUHN  HRS  PLUS,  00  TO  40 

IF(CC0L.EQ.C(14))  60  TO  40 
60  URITE(6,S005)  (FLDd ),  1*1 ,30) 

URITE(6,S002> 

N«N-1 
GO  TO  30 

C BUILD  THE  NUMBER 

70  BUILD  * BUILD  * SION 
IF(NBLANK,6E.30)  60  TO  80 
IFIIDG.EQ.O)  GO  TO  60 
IFIN.LE.NX)  X(N)*BUILO 
60  TO  30 

80  NBFLD*NBFLD«1 
N»N-I 
60  TO  30 

C TRANSFER  FROM  INTE6ER  PART  TO  FRACTIONAL  PART 

90  IF(UHOLE.LT.S.O)  60  TO  100 
UH0LE>1.0 
FR*0.1 

100  NEND>NEND41 

IF(NEND.6E.10)  60  TO  130 
60  TO  40 

C HAKE  THE  NUMBER  NE6ATIVE 

110  SI6N*-1.C 
NEND*NEND>1 

1F(NEND.6E.10)  60  TO  130 
60  TO  40 

C COUNT  THE  BLANKS 

120  NBLANK>NBLANK4| 

IF<1DG)40,40,70 
C ***  EXPONENTIAL  CHECKER  >*•* 

130  N»N-1 

IF<N.LT.3)  60  TO  380 
IFIN.GT.NX)  GO  TO  370 
N-N/2 

IF(M.LT.I)  H»1 

X1>0.0 

X2*0.0 

IF(N.6T.2*N)  X2*X<N) 
xDO  140  1-1  ,N 
X1>X14X<I) 

140  X2-X2+X<1*H) 

X1-(X1/FL0AT<N)) 

K-N-H 

X2-<X2/FL0AT(K)) 

6-FLOAT (N) 
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CALL  USTE8T(X,N,Y,P,Z,AVE,STDEV,R2) 

IF(NIFL8.NE.O)  MR1TE(6,4013)  NBFLO 
URITE(6,4000)  (X(I>,I-1,N) 

URITE<6,400I)  N,AVE,8TDEV 
IF(STOEV.LE.O.O)  GO  TO  390 
C LARGE  8AHPLE  TEST  FOR  EXPONENT lALITT 

URITE<4,4022)  R2 
PCIND>0.0 

IF<N.LE.S)  80  TO  ISO 

FN>ANIN1 <0.3O2S5«AL0G(Q)-t .8237S37*Q»»0.09«2.2944,0.98) 

PCIND«-1.0 

S6N>-I.O 

IF(N.LT.7S)  S6N«1.0 

IF(R2.0E.FN)  PCIND- ( 1. 0-EXP < SON* (0.04*AB8( FLOAT (M-75)))=*=*(1. 0/3.0) 
1-t. 44224937) )*0.5*R2*0.0S*(R2-Fi4)/(t.0-FN) 

150  X1«XI-X(1) 

X2»X2-X<1) 

F=»0.0 

IF(A8S(X2).6T.0.0000001)  F>X1/X2 

H’’2*H 

K«2*K 

CALL  HDFD(F,N,K,PF,IER) 

1F(1ER.GE.128)  PF>0.0 
IF(PF.6T.0.S)  PF»1.0-PF 
PF«2.0*PF 
PCIND»PCIHD*PF 

C UNIFORNITY  TEST  FOR  EXPONENTIAL ITY 

IF(PF.LT.O.I)  60  TO  220 
IF(PF-0.5)  140,170,170 
UO  URITE<4,4007) 

60  TO  180 
170  URITE<6,400B) 

180  URITE (6,4009)  X(1),60E,PF 
C ***  EXPONENTIAL  BOUNDS  CALCULATOR  *** 

190  DIFF»AVE-X(1) 

UR1TZ<6,4027)  X< 1 ) ,DIFF,X( 1 ) 

C0NP«0.0 

BL»X(1) 

IF(AHAX1(DIFF,STDEU) .6E. 1 .8*ANIN1 (DIFF,STDEO) ) PCIN0»0.0 

DO  210  K>10,200 

T=0.1*FL0AT(K) 

PP«1.0-(1.0/(1.0+(T/0))**Q) 

IF<PP.LE.CONP)  GO  TO  210 

BU*T*DIFF*BL 

COHP»PP»0.005 

URITE(6,4003)  T,PP,IL,BU 

IF(CONP.LE.0.9825.0R.PCIND.LT.0.5)  GO  TO  200 

PP»100.0*PP 

C PROVIDE  EXPONENTIAL  BOUNDS  ON  TAPE2 

URITE(2,3S01)  (TITL(I),I>1,3),N,AVE,STDEV,BL,BU,PP 
PCIND>0.0 

200  IF(COHP.GE.I.O)  GO  TO  230 
210  CONTINUE 
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60  TO  230 

220  URITE (6,4010)  PF,IEIt 
IF(IER.NE.O)  60  TO  230 
IF(R2.6T.0.8S.AND.N.6T.50)  60  TO  190 
IF(PCIN0.6E.0.5.AND.N.LE.S0)  60  TO  190 
C ***  NORMAL  CHECKER  *** 

230  PCINO-0.3 
P2«1.0 

IF(N.LT.8)  60  TO  240 
C SKEUNESS  TEST  FOR  NORMALITY 

Z-ABS(Z) 

PCIND=0.0 

PZ=1 .0/(1.0*Z*<.196854+Z*(.1 15194+Z«(. 000344+Z*. 019527) )))*=*4 
HRITE(6,4015)  PZ 
IF(PZ.6E.0.75)  PCIND=PCIND+0.3 
IF(PZ.6T.0.9)  PCIND»PCIND-»0.15 
IF<(PZ-0.5)*(PZ-0.1).LT.0.0)  URITE(6,4004) 

IFIPZ.GE.O.S)  URITE(6,4005) 

1F(PZ.LE.0.1>  URITE(6,4006) 

IF{N.6T.50)  GO  TO  250 

C SHAPIRO-UILK  TEST  FOR  NORMALITY 

240  URITE(6,4014)  P 

IF(P.GE.0.5)  PCIND=PCIND+0.3 
IF(P.6E.0.48)  PCIND-PCIND4-0.1S 
IF<(P-0.5)*(P-0.1).LT.O.0)  URITE(6,4004) 

IF(P.GE.O.S)  URITE(6,40O5) 

IF(P.LE.O.I)  URrT£(6,4006) 

60  TO  310 

C TRANSFORMED  D STATISTIC  TEST  FOR  NORMALITY 

250  P10=-1.3439065+.t99408805/EXP(.01*Q)-39.1)33537/Q+.24783607A/EXP(0 
1*.02)«634. 84298/0**2 
P=1.0 

IF(Y.LT.PIO)  GO  TO  280 

P90=1. 21 938688+. 205750184/EXP(Q*. 01 )-47.655998/Q+.240379393/EXP(fl 
1*.02)+701. 334854/0**2 
IF<y.6T.P90)  GO  TO  260 
URITE<6,4016)  Y,P10,P90 
PCIND=PCIND+0.3 
GO  TO  300 

260  P95=1.55358157+.30118663/EXP(0*.01 )-67.593821/0+.362900004/EXP(Q*. 
102)+1016. 22535/0**2 
IF(Y.GE.P95)  GO  TO  270 
URITE(6,4017)Y,P90.P95 
PC1ND=PCIND+0.15 
60  TO  300 

270  URITE(6,401B)  Y,P95 
P=0.0 
60  TO  300 

280  P05»-1. 73657547+. 29836828/EXP(a*. 01 >-59.054124/0+. 34545809/EXP(Q* 
1.02)+977. 76416/0**2 
IF(Y.LE.P05)  GO  TO  290 
URITE<6,4019)Y,P10,P05 


! 

? 
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PCIND»PCIND^0.1S 
SO  TO  300 

290  UR1TE(«,4020)  Y,P05 
P»0.0 

300  IF<(T-0.5)*(Y-O.1).LT.O.O)  URITE(6,4004) 

IF<Y.6E.0.S)  URITE<4,400S) 

IF(Y.LE.O.I)  URITE(4,4004) 

C ***  NORMAL  BOUNDS  CALCULATOR  *** 

310  IF(P.LT.0.1.0R.PZ.LT.0.25)  60  TO  340 
UR1TE(6,4002) 

ADDON=FLOAT(N)/FLOAT(N+t ) 

M»N-1 

G»0.4 

CONP=0.0 

DO  330  Ks5,200 

6=6+0. 1 

FP=6*6«A0D0N 

CALL  HDFD(FP,1,M,PP,IER) 

IF(IER.6E.128.0R.PP.LE.C0MP)  GO  TO  330 

BL=AVE-G*ST0EV 

BU=AVE+0+STDEU 

COMP=PP +0.005 

URITE(6,4003)  G,PP,BL,BU 

IF(COMP.LE.O. 985.0R.PCIND.lt. 0.5)  GO  TO  320 

PP=100.0+PP 

C PROVIDE  NORMAL  BOUNDS  ON  TAPE2 

URITE(2,3502)  (TITL( I) ,1=1 ,3),N,AVE,STDEV,BL,BU,PP 
PCIND=0.0 

320  IF(C0NP.6E.1.O)  60  TO  340 
330  CONTINUE 

C ***  RANGE  BOUNDS  CALCULATOR  *** 

340  URITE(4,4011) 

L=1 

J=N 

CONP>2.0 
BL«X(1)-1.0 
BU=X(N>+1.0 
DO  360  1=1, N 
IFIL.OE.J)  60  TO  10 
PP»FLOAT<J-L)/ <0+1.0) 

IF(CONP-PP.LT.0.005.0R.X(L).LE.BL.OR.X<J).GE.BU)  GO  TO  350 
BL=X(L) 

BU*X(J) 

CONP«PP 

UR1TE<6,4021)  BL,BU,PP 
PP=100.0+PP 

C PROVIDE  RANGE  BOUNDS  ON  TAPE2 

IFd.EO.I)  URITE(2,3503)  (TITL(L),L»1 ,3),N,AVE,STDEV,BL,IU,PP 
350  L>L+1 
360  J«J-1 
60  TO  10 


i 


I 
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C ERROR  STATEMENTS  **• 

I ; 370  URITE(A,3002)  N,NX 

60  TO  10 

380  HRITE<0,4012)  N 

IF(NBFLO.NE.O)  UR1TE(6,4013)  NBFLD 
IF(N-I)  10,400,410 
3?0  PP>100.0*FLOAT(N-1 )/FL0AT(N«1) 

URI TE ( 2 , 3S03)  ( TI TL ( I ) , I« 1 , 3) ,N , AVE , STDEV , AVE , AOE , PP 
60  TO  10 

400  URITE(2,3S04)  <TlTL(I),l*t,3),X(l ) 

URITE(4,4000)  X(1) 

I GO  TO  to 

410  A0E«AHAX1(X(1),X(2)) 

X(1)>AMIN1(X(1),X<2)) 

X<2)«AVE 

AVE*0.S«<X(1)«X(2)) 

URITE(2,330S)  (TITL(I),I*1,3),A0E,X(t ),X(2) 

I URITE<6,4000)  <X(I),I»1,N) 

60  TO  10 

[ 420  STOP 

P 3000  F0RNAT<IH1,*  THIS  PROGRAM  IS  TO  DETERMINE  IF  A GIVEN  SET 

1,«  DATA  FITS  NORMAL  OR*/*  EXPONENTIAL  ASSUMPTIONS.*) 

3002  FORNATC//*  THIS  PROBLEM  CANNOT  BE  ATTACKED  AT  THIS  TINE.*/ 

1*  THE  TOTAL  NUMBER  OF  DATA  VALUES,*, 14,*,  EXCEEDS  THE  DIMENSION  OF 
2 X,  HHICH  IS  *,I6/*  IF  YOU  RERUN  UITH  X REDIMENSIONED,  BE  SURE  TO 
3ALS0  CHANGE  THE  VALUE  OF  NX.*/) 

3003  FORMAT (///////* PROBLEM  *,I3,* *//) 

3501  F0RHAT(3A10,tX,14,1X,2(Ft0.3,1X),2<F8.3,1X),F4.1,*  £*) 

3502  F0RNAT<3A10,tX,I4,1X,2(F10.3,1X),2(F8.3,1X),F4.1,*  N*) 

3503  F0RHAT<3A10,1X,I4,1X,2(F10.3,1X),2<FB.3,1X),F4.1,*  R*) 

3504  F0RNAT(3A10,4X,*1*,1X,F10.3) 

3505  F0RNAT<3A1O,4X,*2*,1X,F10.3,12X,2(F8.3,1X),SX,*R*) 

4000  FORMATI*  DATAi*,4( 1X,F10.3) ) 

4001  FORMAT <//5X,*NUNBER  OF  POINTS:  *, I5/5X,*AVERAGE  VALUE  (M):  *,FU.4 
1/5X,*STANDARD  DEVIATION  (S):  *,F14.4) 

4002  F0RNAT(5X,*USE  (H-KS)  AS  A LOWER  BOUND  AND  (M*KS)  AS  AN  UPPER  BOUN 
1B*/SX,*UHERE  THE  EXPECTED  PROPORTION  OF  POPULATION  FOUND  UITHIN  TH 
2E  LIHITS*/5X,*IS  A FUNCTION  OF  K AS  SPECIFIED  IN  THE  FOLLOUIN6  TAB 
3LE*//6X,*K*,4X,*PR0P0RTI0N  OF  POPULATION*, SX.*LOUER  BOUND*, 5X, 
4*UPPER  BOUND*/) 

4003  F0RNAT<4X,F4.1,12X,F9.4,12X,E13.7,3X,E13.7) 

4004  FORHAT<SX,*NORNAL  ASSUMPTION  QUESTIONABLE.*) 

4005  FORMAT (5X,*N0RMAL  ASSUMPTION  HAS  SOME  MERIT.*) 

4006  FORMAT (SX,*NORNAL  ASSUMPTION  DOUBTFUL.*) 

4007  F0RNAT(/5X,*EXP0NENTIAL  ASSUMPTION  QUESTIONABLE  WITH*) 

4000  F0RNAT</5X,*EXP0NENTIAL  ASSUMPTION  PROMISING  WITH*) 

400?  F0RNAT(SX,*A>*,F16.6,SX,*AND  MEAN»*,F16.6/SX,*SIGNIFICENCE  LEVEL 
IIS  *,F?.6) 

4010  FORMAT (/3X,*EXP0NENTIAL  ASSUMPTION  DOUBTFUL  SINCE  SI6NIFICENCE  LEV 
1EL  IS  *,F?.6/5X,*(NDFD  ERROR  CODE  *,I3,*)*) 


4011  F0RNAT(////3X,*IF  NORMAL  AND  EXPONENTIAL  ASSUMPTIONS  CANNOT  BE  MAD 
IE  OR  ARE  TOO  RISKY.*) 


I 


; i 

k 


I * 


i i 


4012  F0Ril0T(/SX,*THE  NUHBER  OF  DATA  POINTS,*, IB,*,  IS  TOO  SHALL.* 
t/5X,«rHE  PROBLEN  IS  NOT  TACKLED.*) 

4013  FORNATI//*  THE  NUMBER  OF  BLANK  FIELDS  IS  *,I6/) 

4014  FORHAT<//SX,*TNE  SHAPIRO-UILK  SI6NIFICANCE  LEVEL  IS  *,Ft2.6) 

4013  F0RNAT(//SX,*SI6NIFICENCE  LEVEL  OF  SKEUNESS  TEST  IS  *,Ft2.6> 

40U  FORHAT(//SX,*THE  VALUE,*,E13.A,*,  FOR  THE  TRANSFORMED  D STATISTIC 

1LIES  WITHIN  THE*/SX,*80  PERCENT  LIMITS  OF  *,E13.A,*  AND  *,E13.A) 

4017  FORNAT(//SX,*THE  VALUE,«,E13.6,*,  FOR  THE  TRANSFORMED  D STATISTIC 
1LIES  ABOVE  THE*/SX,*80  PERCENT  LIMIT, *,EI3.0,*,  BELOW  THE  90  PERCE 
2NT  LIHIT,*,E13.6) 

4018  F0RMAT(//5X,*THE  VALUE,*,Et3.A,*,  FOR  THE  TRANSFORMED  D STATISTIC 
ILIES  ABOVE  THE*/SX,*90  PERCENT  LIMIT  OF  *,E13.A) 

4019  F0RHAT(//5X,*THE  VALUE,*,E13.6,*,  FOR  THE  TRANSFORMED  D STATISTIC 
ILIES  BELOW  THE*/SX,*80  PERCENT  LIMIT, «,E13.6,*,  ABOVE  THE  90  PERCE 
2NT  L1NIT,*,E13.6) 

4020  FORNAT(//SX,*THE  VALUE, *,E13.6,*,  FOR  THE  TRANSFORMED  D STATISTIC 
ILIES  BELOW  THE*/5X,*90  PERCENT  LIMIT  OF  *,E13.0) 

4021  FORMAT (/SX,*USE  *,EU.O,*  AS  A LOWER  BOUND  AND  *,E16.6,*  AS  AN* 
1/5X,*UPPER  BOUND.  EXPECTED  PROPORTION  OF  POPULATION  BOUNDED  IS* 
2,F9.4) 

4022  F0RMAT(//5X,*SIGNIFICANCE  LEVEL  OF  LAR8E  SAMPLE  TEST  FOR  EXPONENTI 
1ALITY  IS*,F9.i/5X,*THIS  RESULT  SHOULD  NOT  BE  USED  IF  THE  NUMBER  0 
2F  DATA  POINTS  IS  LESS  THAN*/SX,*30.  ITS  SISNIFICANCE  SHOULD  BE  TA 
3KEN  WITH  A GRAIN  OF  SALT  IF  THE  NUNBER*/5X,*0F  DATA  POINTS  IS  LESS 
4 THAN  100.  FOR  100  POINTS  OR  N0RE*/5X,*EXP0NENTIALITY  IS  DOUBTFUL 

3 IF  THE  SIGNIFICANCE  LEVEL  IS  LESS«/5X,*THAN  .96  AND  PROMISING  IF 
GIT  EXCEEDS  .99.*/) 

4027  F0RNAT(//3X,«USE*,F1i.6,SX,*AS  THE  LOWER  BOUND  AND  */ 

23X,F10.0,*K  * *,FU.O,SX,*AS  THE  UPPER  B0UND*/5X,*THE  EXPECTED  PRO 
3P0RTI0N  OF  THE  POPULATION  FALLING  WITHIN  THESE*/5X,*B0UNDS  DEPENDS 

4 UPON  K AS  INDICATED  IN  THE  FOLLOWING  TABLE«//6X,«K*,GX,*PROPORT10 
SN  OF  POPULATION*, 5X,*L0HER  BOUND*, 5X,*UPPER  BOUND*/) 

5000  F0RNAT(30A1) 

5001  F0RMAT<3X,3A10//) 

3002  FORMAT (*  THE  NUMBER  IN  THIS  FIELD  IS  NOT  INCLUDED  IN  THE  DATA  SET. 
1*/) 

5003  F0RNAT(3A10) 

5003  FORMAT(*  CONTENTS  OF  FIELD:  *,30A1) 

END 


C - 

C 

C 

C 

C 

C 

C 

C 


SUBROUTINE  HSTEST(X,N,Y,P,Z,AVE,STDEV,R2) 

- SUBROUTINE  TO  APPLY  THE  W TEST  FOR  NORMALITY  (COMPLETE  SAMPLES) 
OF  SHAPIRO  AND  WILK 

X IS  A SET  OF  DATA  TO  BE  TESTED  FOR  NORMALITY 
N IS  THE  NUMBER  OF  ITEMS  IN  ARRAY  A 

X WILL  BE  SORTED  IN  INCREASING  ORDER  X(1 ) .LE.X(2).LE.X<3). .. 
RESULT  W IS  THE  TEST  STATISTIC  OF  SHAPIRO  AND  WILK 
P IS  THE  SISNIFICANCE  LEVEL  ASSOCIATED  WITH  W 
RESTRICTIONS  3.LE.N.LE.S0 
DIMENSION  X(N) 
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I 


0IHEN8I0N  1NOEXA(S1),A(62S),6ANHA(SO),OELTA(SO),EPSILN(SO) 

C INDEXA(N)  IS  THE  INDEX  OF  THE  FIRST  ITEM  IN  ARRAY  4A*  TO  BE  USED 
BATA<INDEXA(I),I>2,St)  / 1,  2,  3,  5,  7,  tO,  13,  17,  21 

1,  26,  31,  37,  43,  SO,  57,  65,  73,  82,  91,101,111,122,133,145,157 
2,170,183,197,211,226,241,257,273,290,307,325,343,362,381,401,421 
3,442,463,485,507,530,553,577,601,626/ 

DATA(A(I),I>1, 210)/. 7071, .7071, .6872,. 1677,. 6646,. 2413, .6431,. 2806 

1..  0875.. 6233.. 3031.. 1401.. 6052.. 3164.. 1743. .0561. .5888.. 3244.. 1976 

2..  0947.. 5739.. 3291.. 2141.. 1224.. 0399.. 5601.. 3315. .2260.. 1429.. 0695 

3..  5475.. 3325.. 2347.. 1586.. 0922.. 0303.. 5359. .3325. .2412.. 1707.. 1099 

4..  0539.. 5251.. 3318.. 2460.. 1802.. 1240.. 0727. .0240.. 5150.. 3306.. 2495 

5..  1878.. 1353.. 0880.. 0433.. 5056.. 3290.. 2521 ..1939.. 1447.. 1005.. 0593 

6..  0196.. 4968.. 3273.. 2540.. 1988.. 1524.. 11 09.. 0725.. 0359.. 4886. .3253 

7..  2553.. 2027.. 1587.. 1197.. 0837.. 0496.. 0163.. 4808.. 3232.. 2561.. 2059 

8..  1641,.  1271,. 0932,. 0612,. 0303,. 4734,. 3211  , .2565, .2085,. 1686, .1334 

9..  1013.. 0711.. 0422.. 0140.. 4643.. 3185.. 2578. .21 19. .1736.. 1399.. 1092 

A, . 0804,. 0530,. 0263,. 4590,. 3156,. 2571,. 2131,. 1764,. 1443,. 1150,. 0878 

B, . 0618,. 0368,. 0122,. 4542,. 3126,. 2563,. 2139,. 1787,. 1480,. 1201,. 0941 

C, . 0696,. 0459,. 0228,. 4493,. 3098,. 2554,. 2145,. 1807,. 1512,. 1245.. 0997 
B,. 0764,. 0539,. 0321,. 0107,. 4450,. 3069,. 2543,. 21 48,. 1822,. 1539,. 1283 

E, . 1046,. 0823,. 0610,. 0403,. 0200,. 4407,. 3043,. 2533, .2151,. 1836,. 1563 

F, . 1316,. 1089,. 0876,. 0672,. 0476,. 0284,. 0094,. 4366,. 3018,. 2522,. 2152 

8..  1848.. 1584.. 1346.. 1128.. 0923.. 0728.. 0540. .0358. .0178.. 4328.. 2992 
H,. 2510,. 21 51,. 1857,. 1601,. 1372,. 11 62,. 0965,. 0778,. 0598,. 0424, .0253 

1..  0084.. 4291.. 2968.. 2499.. 21 50.. 1864.. 161 6.. 1395.. 1192.. 1002.. 0822 
J,. 0650,. 0483,.  0320,. 01 59/ 

DATA( Ad),  1=21 1,420)  7.4254,. 2944,. 2487, .21 48, .1870,  .1630,  .1415 

1..  121 9,.  1036,. 0862,. 0697,. 0537,. 0381  ,. 0227,. 0076,. 4220,. 2921,. 2475 

2..  2145.. 1874.. 1641.. 1433.. 1243.. 1066.. 0899. .0739.. 0585. .0435. .0289 

3..  0144.. 41 88.. 2898.. 2463.. 21 41.. 1878.. 1651 ..1449.. 1265.. 1093.. 0931 

4..  0777.. 0629.. 0485.. 0344.. 0206.. 0068.. 41 56.. 2876.. 2451  ,.21 37,. 1880 

5..  1660..1463..1284..1 118.. 0961.. 0812.. 0669.. 0530.. 0395.. 0262.. 0131 

6..  41 27.. 2854.. 2439.. 21 32.. 1882.. 1667.. 1475.. 1301  ,.  1140,. 0988,. 0844 

7..  0706.. 0572.. 0441.. 031 4.. 01 87.. 0062.. 4096.. 2834.. 2427.. 21 27.. 1883 

8..  1673,.  1487,.  131 7,.  11 60,.  101 3,. 0873,. 0739,. 0610,. 0484,. 0361,. 0239 

9..  01 19.. 4068.. 281 3.. 241 5.. 21 21.. 1883.. 1678.. 1496.. 1331.. 1179.. 1036 

A, . 0900,. 0770,. 0645,. 0523,. 0404,. 0287,. 0172,. 0057, .4040,. 2794,. 2403 

B, . 21 16,. 1883,. 1683,. 1505,. 1344,. 11 96,. 1056,. 0924,. 0798,. 0677,. 0559 

C, . 0444,. 0331  ,. 0220,. 01 10,. 401 5,. 2774,. 2391  ,.21  10,. 1881  ,. 1686,. 1513 
B,. 1356,. 1211,. 1075,. 0947,. 0824,. 0706,. 0592,. 0481,. 0372,. 0264,. 0158 

E, . 0053,. 3989,. 2755,. 2380,. 21 04,. 1880,. 1689,. 1520,. 1366,. 1225,. 1092 

F, . 0967,. 0848,. 0733,. 0622,. 051 5,. 0409,. 0305,. 0203,. 01 01,. 3964,. 2737 

6..  2368.. 2098.. 1878.. 1691.. 1526.. 1376.. 1237. .1108.. 0986.. 0870.. 0759 
H,. 0651,. 0546,. 0444,. 0343,. 0244,. 0146,. 0049,. 3940, .2719, .2357,. 2091 

1..  1876.. 1693.. 1531.. 1384.. 1249.. 11 23.. 1004.. 0891.. 0782.. 0677.. 0575 
J,. 0476,. 0379,. 0283,. 0188,. 0094/ 

DATA(A(I), 1=421, 625)  /. 3917,. 2701 , .2345, .2085, .1874, .1694, .1535 

1..  1392.. 1259.. 1136.. 1020.. 0909.. 0804.. 0701.. 0602.. 0506.. 0411.. 0318 

2..  0227.. 01 36.. 0045.. 3894.. 2684.. 2334.. 2078. .1871.. 1695.. 1539.. 1398 

3..  1269.. 11 49.. 1035.. 0927.. 0824.. 0724.. 0628.. 0534.. 0442.. 0352. .0263 

4..  0175.. 0087.. 3872.. 2667.. 2323.. 2072.. 1868.. 1695.. 1542.. 1405.. 1278 

5..  1160.. 1049.. 0943.. 0842.. 0745.. 0651.. 0560.. 0471.. 0383.. 0296.. 0211 
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A,. 0t2«,. 0042,. 3850,. 2^51  f 2313,. 206S,. 1845,. U9Sm 1345,. 1410,. 1286 

7..  1170..  1062,. 095?,. 0860,. 0765,. 0673,. 0584, .0497, .0412, .0328,. 0245 

8..  0163.. 0081.. 3830.. 2635.. 2302.. 2058.. 1862. .1695. .1548.. 1415.. 1293 

9 . .  1 1 80 . . 1 073 . . 0972 . . 0876 .. 0783 .. 0694 .. 0607 .. 0522 .. 0439. . 0357. . 0277 
A,. 0197,. 01 18,. 0039,. 3808,. 2620,. 2291,. 2052,. 1859, .1695,. 1550,. 1420 

1..  1300.. 1189.. 1085.. 0986.. 0892.. 0801.. 0713.. 0628. .0546. .0465.. 0385 

C, . 0307,. 0229,. 0153,. 0076,. 3789,. 2604,. 2281,. 2045, .1855, .1693,,. 1551 

D,  . 1 423 , . 1 306 , . 1 1 97 , . 1 095 , . 0998 , . 0906 , . i 1 7 , . 073 1 , . 0648, .0568, .0489 

E, . 0411,. 0335,. 0259,. 0185,. 01 11,. 0037,. ^770,. 2589, .2271,. 2038,. 1851 

F, . 1692,. 1553,. 1427,. 1312,. 1205,. 1105,. 1010, .0919, .0832, .0748, .0667 

6..  0588.. 0511.. 0436.. 0361.. 0288.. 0215.. 01 43.. 0071. .3751.. 2574. .2260 

H, . 2032,. 1847,. 1691,. 1554,. 1430,. 1317,. 1212,. 11 13,. 1020,. 0932,. 0846 

1..  0764.. 0685.. 0608.. 0532.. 0459.. 0386.. 0314. .0244. .0174.. 0104.. 0035 
J/ 

DATA  (6ANHA(I),I«3,50)  70.625,1.107,1.530,2.010,2.356,2.696,2.968 

I, 3.262,3.485,3.731,3.936,4.155,4.373,4.567,4.713,4.885,5.018,5.153 
2,5.291,5.413,5.508,5.605,5.704,5.803,5.905,5.988,6.074,6.160,6.248 
3,6.324,6.402,6.480,6.559,6.640,6.721,6.803,6.887,6.961,7.035,7.111 
4,7.188,7.266,7.345,7.414,7.484,7.555,7.615,7.677/ 

DATA  (DELTA(I),I-3,50)  70.386,0.714,0.935,1 .138,1 .245,1 .333,1 .400 

1,1.471,1.515,1.571,1.613,1.655,1.695,1.724,1.739,1.770,1.786,1.802 
2,1.818,1.835,1.848,1.862,1.876,1.890,1.905,1.919,1.934,1.949,1.965 

3.1.976.1.988.2.000. 2.012.2.024.2.037.2.049.2.062.2.075.2.088.2.101 


*«• 


*«* 


900 

910 

920 

930 


4,2.114,2.128,2.141,2.155,2.169,2.183,2.198,2.2127 
DATA  (EPSILN(I),I=3,50)  7.7500, .6297, .5521 ,.4963, .4533, .4186 

1..  3660.. 3451.. 3270.. 31 11.. 2969.. 2842.. 2727.. 2622.. 2528.. 2440 

2..  2264.. 2207.. 2157.. 2106.. 2063.. 2020.. 1980.. 1943.. 1907.. 1872 

3..  1811.. 1781.. 1755.. 1727.. 1702.. 1677.. 1656.. 1633.. 1612.. 1591 

4..  1552.. 1534.. 151 6.. 1499.. 1482.. 1466.. 1451.. 1436/ 

CALCULATE  THE  U NORHALITY  TEST  STATISTIC  OF  SHAPIRO  AND  UILK 
NON-NORMAL  RETURN  FOR  N .LT.  3 

U =0. 

P =0. 

IF  (N.LT.3)  RETURN 

SORT  SAMPLE  IN  ASCENDING  ORDER 
DO  930  I=2,N 

JHAX  =1-1 

Y7EST  =X<I) 

IF  (YTEST.GE.X(JNAX))  GO  TO  930 

DO  900  IJ=1,JHAX 

IF  (YTEST.LT.XdJ))  GO  TO  910 

CONTINUE 

DO  920  IK=IJ,JNAX 
II  =IJ+JMAX-IK 

X(II+1)  =X(II) 

X<IJ)  =YTEST 

CONTINUE 

S>0.0 


f • 
» • 
» • 
f • 


3900 

2359 

1840 

1572 


R2«0.O 

DO  940  I > l,N 
R2«R2-»ALOG<FLOAT(I)) 
940  S=SiX(I) 


i 

I 
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OsFLOAT(N) 

AVE>S/Q 

R2>R2/Q 

SSQ>0.0 

Y3»0.0 

Y4=0.0 

Y6-0.0 

Y7»0.0 

Y5=0*1. 

Y2—Y5/2.0 
DO  WO  1»1  ,M 
Y2-Y2+1.0 
Y4«Y4+Y2*X(I) 

Y8*X(I)-AVE 
SSQ»SSB+Y8*Y8 
Y3»Y3+Y8*Y8*Y8 
Y5«Y5-1 .0 
Y9=AL06<Y5>-R2 
Y6»Y6+Y9*Y9 
950  Y7«Y7+Y9*Y8 

SSQ  .SYMMETRIC  ESTIMATE  OF  OARIANCE 
MOR-HORHAL  return  if  SSQ  . 0. 


«' 


P 

IF 

**•  R2 


= 1 . 
=1. 


= R-SOUARED  FOR  LARGE  SAMPLE  TEST  FOR  EXPONENTIALITY 

R2=Y7*Y7/<SS0*Yi) 

IF(H.GT.50)  60  TO  970 

C •*«  B =UEIGHTED  SUM  OF  ORDER  STATISTICS 

KHIN  =IMDEXA(H) 

KNAX  =INDEXA(N+1)-1 

II  =1 

IJ  =N 

B =0. 

DO  960  IK=KMIN,KMAX 

B =B+A(IK)*<X(IJ)-X(I1) ) 

II  =n+i 

960  IJ  =!» 

BSQ  =B*B 

C *4*  u = test  STATISTIC 

U .BSQ/SSQ 


C ♦**  NON-NORMAL  RETURN  IF  U .LE.  EPISLN 
<W-EPSILN(N))/(1.-W) 


C 


AHOLD 

P 

IF 

P 

ZA 

ZZ 

ZZ2 

P 


1 


=0. 

(AHOLD. LE.O.)  GO  TO  980 
.SIGNIFICANCE  LEVEL  ASSOCIATED  WITH  U 

=DELTA(N)*ALOG(AHOLD)-GAMMA(N) 

=ABS(ZA) 

.ZZ4>*2 

»1./(2.*(1.*.196854*ZZ+.115196»ZZ2 

♦ .0OO34A*ZZ*ZZ2* .01 9527*ZZ2*2Z2)=**A) 


3A 


I 


(ZA.6T.0.)  9 =1.-P 


IF  (ZA.6T.0.)  9 =1.-P 

60  TO  980 

C ***  CALCULATE  STATISTICS  FOR  D TEST  OF  NORMALITY 
970  D*Y4/SQRT(FL0AT(N*N*N)*SSQ) 

Y=S0R  T ( FLO AT  < N ) ) ♦ (D- .28209479 ) / . 02998598 
C ***  CALCULATE  STATISTICS  FOR  SKEWNESS  TEST 
980  IF(H.LT.8)  GO  TO  990 

B1  =SQRT(  FLOAT  ( N)  )*Y3/SQRT  ( SSO»SSO=*SSO ) 

Y1=FL0AT((N*N+27*N-70)*(N+1)*(N*3))/FLOAT( (N-2)*(N+5I*<N+7)*(N+9)) 
W2=-1.0+S0RT(6.C*Y1-2.0) 

YA=B1*B1*FL0AT{(K+1)*(N+3))=*<U2-1.0)/(12.0*FLOAT(N-2)) 

R=1.0/SQRT<AL06(SQRT(U2))) 

Z=R*AL0G(S0RT(YA)+SORT(YA+1.0)) 

990  STDEV=SQRT<SSQ/FL0AT(N-1)) 

RETURN 

END 


I 


APPENDIX  C 
SAMPLE  RUN 


This  sppSudlz  prcviuSs  d lidiBpld  run  or  uhisCr.*  Tiie  dsCd  used  In  the 
saiq>le  run  are 


TEST  DATA  SET  I 
- .9 


.12-  VALUE  4 

I5AVALUE  5 
-I  VALUE  1 

2 VALUE  2 

-0.3  VALUE  3 

0 

-.834 

1 

0./ 

0.0 

-.49 


TEST  DATA  SET  2 


Because  the  ”2"  In  the  title  of  the  second  data  set  la  In  the  thirty-first 
column  of  the  data  card.  It  will  not  be  printed  In  the  title  In  the  out- 
put. Note  also  that.  In  the  thirteenth  data  field  of  the  first  data  set, 
a slash  occurs  prior  to  the  blank  ifhlch  Indicates  the  number  has  been 


- v’l'  <■-  5.:a 


completed.  Because  the  slash  Is  an  Inadmissible  character,  the  data  field 
is  rejected  (see  the  sample  printed  output  beginning  at  the  bottom  of  this 
page) . The  fourth  and  eighth  data  fields  in  the  first  data  set  are  also 
rejected  because  the  "X"  and  the  "V"  are  also  Inadmlssable  characters. 

The  output  consists  of  both  printed  and  file  output,  as  discussed  in 
subsection  3.3.  The  file  output  for  the  sample  run  is 

TEST  DATA  SET  t H .19d  1.873  -1.000  4.383  98.0  E 

TEST  DATA  SET  1 14  .196  1.873  -1.000  6.000  86.7  R 

TEST  DATA  SET  9 1.111  1.0S4  0.000  3.000  80.0  R 


The  printed  output  for  the  sample  run  completes  this  appendix. 

Mote  that  the  difference  between  the  two  largest  values  in  the  first  data 
set  Is  large  compared,  for  example,  with  the  standard  deviation.  Vlhen 
the  analyst  examines  this  printed  output,  he  may  trlsh  to  delete  the 
largest  value  as  a possible  "outlier",  or  bad  value,  and  run  CHECK  again 
with  the  resulting  data  set. 

PROBLEH  1 

TEST  DATA  SET  1 

CONTENTS  OF  FIELD:  X - .933 

THE  NUMBER  IN  THIS  FIELD  IS  NOT  INCLUDED  IN  THE  DATA  SET. 

CONTENTS  OF  FIELD:  156VALUE  5 

THE  NUMBER  IM  THIS  FIELD  IS  NOT  INCLUDED  IN  THE  DATA  SET. 

CONTENTS  OF  FIELD:  0./ 

THE  NUMBER  IN  THIS  FIELD  IS  NOT  INCLUDED  IN  THE  DATA  SET. 

DATA:  -1.000  -1.000  -.900  -.888  -.854  -.700 

-.490  -.300  -.120  0.000  0.000  1.000 

2.000  6.000 

NUMBER  OF  POINTS:  14 

AVERAGE  VALUE  (N):  .196286 

STANDARD  DEVIATION  (S):  1.873355 


SI8NIFICANCE  LEVEL  OF  LAR6E  SAMPLE  TEST  FOt  EXPONENT1M.ITY  II  .147145 
THIS  RESULT  SHOULD  MOT  IE  USED  IF  THE  MUMIER  0 F DATA  POINTS  II  LESS  THAN 
30.  ITS  SI6NIFICANCE  SHOULD  IE  TAKEN  UITH  A 6RAIN  OF  SALT  IF  THE  NUNDr 
OF  DATA  POINTS  IS  LESS  THAN  100.  FOR  100  POINTS  OR  MORE 
EXPONENTIALITY  IS  DOUDTFUL  IF  THE  SI6NIF1CANCE  LEVEL  IS  LESS 
THAN  .08  AND  PR0NISIN6  IF  IT  EXCEEDS  .09. 


EXPONENTIAL  ASSUMPTION  PROMISING  UITH 
A*  -1.000000  AND  NEAN« 

SIGNIFICENCE  LEVEL  IS  .903690 


.196286 


USE  -1 .000000  AS  THE  LOUER  ROUND  AND 

1.196286K  * -1.000000  AS  THE  UPPER  BOUND 

THE  EXPECTED  PROPORTION  OF  THE  POPULATION  FALLING  UITHIN  THESE 
ROUNDS  DEPENDS  UPON  K AS  INDICATED  IN  THE  FOLLOWING  TABLE 


K 

PROPORTION  OF  POPULATION 

LOUER  ROUND 

UPPER  BOUND 

1.0 

.619360 

-.lOOOOOOEtOI 

.1962857E^OO 

1.1 

.653171 

-.1000000E+01 

.31S9143E400 

1.2 

.683785 

-.lOOOOOOE^OI 

.4355429E+00 

1.3 

.711522 

-.1000000E+01 

.5551714E+00 

1.4 

.736669 

-.lOOOOOOE^OI 

.6748000E+00 

1.S 

.759481 

-.lOOOOOOEtOI 

.7944286E400 

1.6 

.780190 

-.lOOOOOOE^Ot 

.9140571E+00 

1.7 

.798999 

-.lOOOOOOEtOI 

.1033686E401 

1.8 

.816095 

-.lOOOOOOE+01 

.n53314E401 

1.9 

.831642 

-.1000000E+01 

.1272943E+01 

2.0 

.845790 

-.1000000E+01 

.1 392571 E+01 

2.1 

.858671 

-.lOOOOOOE^OI 

.1512200E401 

2.2 

.870407 

-.lOOOOOOEtOI 

.1631S29E401 

2.3 

.881104 

-.tOOOOOOE4^01 

.1751457E+01 

2.4 

.890861 

-.lOOOOOOE^OI 

.1871086E+01 

2.5 

.899765 

-.1000000E+01 

.1 99071 4E+01 

2.6 

.907896 

-.1000000E+01 

.2110343E401 

2.7 

.915324 

-.lOOOOOOEtOI 

.2229971E+01 

2.8 

.922113 

-.lOOOOOOEtOI 

.2349600E401 

2.9 

.928323 

-.lOOOOOOE^OI 

.2469229E+01 

3.0 

.934006 

-.lOOOOOOE^OI 

.2588857E401 

3.1 

.939208 

-.1000000E401 

.2708486E401 

3.3 

.948341 

-.1000000E+01 

.2947743E+01 

3.5 

.956020 

-.lOOOOOOE^OI 

.3187000E401 

3.7 

.962488 

-.lOOOOOOE+01 

.34262S7E401 

3.9 

.967948 

-.lOOOOOOEfOl 

.366551 4E 401 

4.2 

.974602 

-.lOOOOOOE^OI 

.4024400E401 

4.5 

.979798 

-.1000000E+01 

.4383286E401 

4.9 

.985026 

-.lOOOOOOEfOl 

.4861BO0E401 

5.5 

.990333 

-.lOOOOOOEtOI 

.5579571 £401 

6.6 

.995516 

-.lOOOOOOEtOI 

.6895486E401 
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SIBNIFICCNCE  LEVEL  OF  8KEUNE88  TEST  IS  .000419 
NOINOL  088UNFT10N  OOUITFUL. 


THE  SHOPIRO-UILK  SI6NIFICANCE  LEVEL  IS  .000062 
NORMAL  ASSUMPTION  OOUITFUL. 


IF  NORMAL  AND  EXPONENTIAL  ASSUMPTIONS  CANNOT  IE  NAIE  OR  ARE  TOO  RISKY, 

USE  -.lOOOOOE^OI  AS  A LOUER  lOUNO  AND  .600000E>01  AS  AN 

UPPER  lOUNI.  EXPECTED  PROPORTION  OF  POPULATION  lOUNIED  IS  .866667 

USE  -.POOOOOEfOO  AS  A LOUER  ROUND  AND  .lOOOOOE^Of  AS  AN 

UPPER  ROUND.  EXPECTED  PROPORTION  OF  POPULATION  ROUNDED  IS  .600000 

USE  -.888000E^OO  AS  A LOUER  SOUND  AND  0.  AS  AN 

UPPER  ROUND.  EXPECTED  PROPORTION  OF  POPULATION  BOUNDED  IS  .466667 

USE  -.700000E«00  AS  A LOUER  BOUND  AND  -.120000E-»00  AS  AN 

UPPER  ROUND.  EXPECTED  PROPORTION  OF  POPULATION  BOUNDED  IS  .200000 

USE  -.490000E400  AS  A LOUER  BOLWD  AND  -.300000E*00  AS  AN 

UPPER  BOUND.  EXPECTED  PROPORTION  OF  POPULATION  BOUNDED  IS  .066667 


PROBLEM  2 


TEST  DATA  SET 


THE  NUMBER  OF  BLANK  FIELDS  IS  3 

DATAl  0.000  0.000  0.000  1.000  1.000  1.000 

2.000  2.000  3.000 


NUMBER  OF  POINTS i 9 

AVERAGE  VALUE  <N)t  1.111111 

STANDARD  DEVIATION  (S>i  1.054093 


40 


T 


EXPONENTIAL  A88UNPTI0N  DOUBTFUL  SINCE  SIGNIFICENCE  LEVEL  IS  .010049 
(NDFO  ERNOR  CODE  0) 


SI6NIFICENCE  LEVEL  OF  8KEUNES8  TEST  IS  .431  US 
NORMAL  ASSUMPTION  OUESTIONADLE. 


I 


I 


1 


I 


THE  SHAPIRO-UILK  SIGNIFICANCE  LEVEL  IS  .192172 
NORMAL  ASSUMPTION  OUESTIONADLE. 

USE  (M-KS)  AS  A LOUER  BOUND  AND  <N4KS)  AS  AN  UPPER  BOUND 

UHERE  THE  EXPECTED  PROPORTION  OF  POPULATION  FOUND  UITHIN  THE  LIMITS 

IS  A FUNCTION  OF  K AS  SPECIFIED  IN  THE  FOLLOUING  TABLE 


K 

PROPORTION  OF  POPULATION 

LOUER  BOUND 

UPPER  BOUND 

.5 

.352066 

.5840646E«00 

.1638157E-»01 

.6 

.415164 

.47865S6E«00 

.1743567E^01 

.7 

.474687 

.3732463E«00 

.1848976E«01 

.8 

.530347 

. 2678371 EtOO 

.1954385E«01 

.9 

.581962 

.1624278E>00 

.2059794E+01 

1.0 

.629445 

.5701856E-01 

.2165204E-»01 

1.1 

.672602 

-.4839070E-01 

.2270613E»01 

1.2 

.712114 

-.1538000E>00 

.2376022E-»01 

1.3 

.747526 

-.2592092E+00 

.2481431E^1 

1.4 

.779234 

-. 36461 85E«00 

. 258684  lE-fOl 

1.S 

.807467 

-.4700277E+00 

.2692250E«01 

1.6 

.832460 

-.5754370E+00 

.2797659E+01 

1.7 

.854539 

-.6808462C-fOO 

.2903068E«01 

1.8 

.873912 

-.7862S55E-^00 

.3008478E^01 

1.9 

.890865 

-.8916647E«00 

.3113887E401 

2.0 

.905650 

-.9970740E+00 

.3219296E+01 

2.1 

.918510 

-.1102483E>01 

.3324705E*01 

2.2 

.929666 

-.1207B93E+O1 

.34301 15E«01 

2.3 

.939324 

-.1313302E«01 

.3535524E-»01 

2.4 

.947671 

-.1418711E+01 

.3640933E^01 

2.S 

.954874 

-.1524120Et01 

.3746342E401 

2.6 

.961082 

-.1629530E«Ol 

.38517S2E401 

2.7 

.966429 

-.1734939E+01 

.3957161E+01 

2.9 

.974987 

-.1945757E+01 

.4167980E+01 

3.1 

.981317 

-.2156576E+01 

.4378798E*01 

3.4 

.987862 

-.2472804E+01 

.4695026E«01 

3.6 

.993068 

-.2894441E«01 

.S116663E+01 

4.8 

.998135 

-.3948533Et01 

.6170755E+01 

IF  NORMAL  AND  EXPONENTIAL  ASSUMPTIONS  CANNOT  BE  HADE  OR  ARE  TOO  RISKY, 

USE  0.  AS  A LOUER  BOUND  AND  .300000E401  AS  AN 

UPPER  BOUND.  EXPECTED  PROPORTION  OF  POPULATION  BOUNDED  IS  .800000 
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APPEHDIX  D 

CROSS-REFERENCE  TABLE  OF  VARIABLES  AND  PROGRAM  USAGE 


This  appendix  provides  a listing  of  the  variable  nanes  and  Indicates 
the  use  of  each  variable  In  CHECK. 


A 

Table  of  coefficients  for  the  Shaplro-Hllk  test  for 
normality. 

ADDON 

Used  In  calculation  of  significance  level  for  normal 
bounds  (see  Equation  (26));  ADDON  ■ N/(N+1). 

AHOLD 

Intermediate  value  used  In  calculation  of  the  signifi- 
cance level  of  the  Shaplro-Wilk  test  statistic. 

AHOLD  - (W-E  )/(l-W)  of  Equation  (22) 
n 

AVE 

Average  of  the  data  values. 

B 

Weighted  sum  of  the  order  statistics  used  In  calcu- 
lating the  Shaplro-Wllk  test  statistic  (see  BSQ) . 

BL 

Lower  bound  for  the  printed  lines  In  the  three  bounds 
calculators  (see  subsections  2.4,  2.6,  and  2.7). 

BSQ 

Square  of  the  weighted  sum  of  the  order  statistics 
used  In  calculating  the  Shaplro-Wilk  test  statistic. 
Numerator  of  Equation  (19). 

BU 

Upper  bound  for  the  printed  lines  in  the  three  bounds 
calculators  (see  subsections  2.4,  2.6,  and  2.7) 

BUILD 

Used  to  hold  the  current  number  as  It  Is  being  built 
(see  subsection  2.2). 

B1 

Sample  skewness  (see  Equation  (12)). 

C 

Table  of  admissible  characters  (see  subsection  2.2). 

CCOL 

Contents  of  current  column;  CCOL  FLD(MCOL). 

COMP 

Used  to  control  printed  output  In  the  bounds  calcu- 
lators. A line  will  be  printed  only  If  PP  (see  PP) 
has  Increased  by  at  least  0.005  since  the  last  printed 
line. 

D 

D test  statistic  (see  Equation  (21)). 

DELTA 

Table  of  values,  one  of  %dilch  is  used  in  each  Shaplro- 
Wllk  test  to  calculate  the  significance  level  associated 
with  the  test  statistic  (see  W).  DELTA(n)  is  the  D^  of 
Equation  (22). 

DIFF 

Difference  between  the  average  of  the  data  values 
(see  AVE)  and  the  smallest  of  the  data  values  (see  X). 

Tablft  of  valuaa»  oua  of  ^tch  la  uaad  in.  aacn 
Shaplro-Wllk  taat  as  an  initial  Indication  of  non- 
QoxBallty  of  tha  data*  EP8IUJ(n)  la  of 
Equation  (21)* 

Onlfoxmlty  tast  statistic;  f of  Equation  (3)* 

Array  containing  the  current  data  field. 

Mlnlauai  function  of  N to  aid  In  detemlnlng  the 
value  of  PCIMD.  If  R2  Is  less  than  FN,  an  exponen- 
tial line  will  not  be  Included  on  TAPE2  for  that 
data  set.  See  Equation  (10) 

Used  to  control  printed  output;  F(l-p)  of  Equation 
(26). 

Ptaetlonal  part  indicator*  FR  “ 1 before  the  first 
aolnti  Ifl  escoustereu.  FR  = 0.1  afterward. 


EPSllH 


FRACT 


GAMMA 


Iin)EXA 


Index  of  the  flret  value  in  A (see  A)  to  be  used  In 
celculatlon  of  V (see  W)< 

Index  of  lower  bound  in  range  bounds* 

Largest  integer  no  larger  than  one-half  of  N.  If  N 
is  even,  M - N/2.  If  N is  odd,  M - (N-l)/2. 

Number  of  data  points. 

Nuad>er  of  all-blank  data  fields  in  the  current  data 
set. 

Number  of  blank  columns  in  the  current  data  field. 

Number  of  the  column  under  consideration. 

Nuiid>er  of  decimal  points  and  minus  signs  since  the 
last  digit  in  the  current  field.  NEND  “ 10  Indicates 
the  end  of  the  data  set. 

Problem  (data  set)  number. 

Dimension  of  X.  If  the  dimension  of  X is  changed,  the 
value  of  NX  must  be  changed  appropriately. 

Significance  level  of  the  Shapiro-Wllk  test  statistic. 

Variable  used  to  control  selection  of  summary  information 
for  inclusion  on  TAPE2  (see  paragraphs  2.3.3  and  2.5.2). 

Significance  level  for  the  uniformity  test  (see  para- 
graph 2.3.2). 

Expected  probability  that  another  draw  from  the  same 
distribution  will  lie  within  the  suggested  bounds  (see 
subsections  2.4,  2.6,  and  2.7). 

Significance  level  for  the  skewness  test  statistics 
(see  paragraph  2.5.1) 

Five  percentile  for  the  transformed  D statistic  (see 
paragraph  2.5.3). 

Ten  percentile  for  the  transformed  D statistic  (see 
paragraph  2.5.3). 

Ninety  percentile  for  the  transformed  D statistic 
(see  paragraph  2.5.3). 

Ninety-five  percentile  for  the  transformed  D statistic 
(see  paragraph  2.5.3). 

Number  of  data  points;  Q ■ N. 

E of  Equations  (14)  and  (15). 

2 

Significance  level  (R  ) of  the  large  sample  test  for 
exponentiality  (see  Equation  (1)). 

Sum  of  the  data  values. 


KMIM 


NBFLD 


NBLANK 


NCOL 


NEND 


PCIND 


SGM 

SIGN 

SSQ 

STDEV 

T 

TITL 

V 

UHOLE 

W2 

X 


XI 

X2 

Y 

YA 

YTEST 

Y1 

Y2 

Y3 

Y4 

Y5 

Y6 

Y7 

Y8 

Y9 

I 

Z 


Indicator  variable  uaed  In  calculation  of  PCIND. 

SGN  * 1 If  N la  leaa  than  75*  Otherwlse«  SGN  - - !• 

Sign  of  the  current  data  value  (plus  or  minus  one) • 

Sum  of  the  squared  deviations  from  their  mean  of  the 
data  values. 

Standard  deviation  of  the  data  values. 

Used  to  control  printed  output.  T ■ t of  Equation  (8). 
Data  set  title. 

Shaplro-Wllk  test  statistic  (see  Equation  (19)). 

Integer  part  multiplier.  WHOLE  ■ 10  before  the  first 
decimal  point  Is  encountered  for  each  field. 

WHOLE  * 1 afterward. 

Square  of  W of  Equation  (13). 

Array  used  to  store  the  values  In  the  current  data  set. 

In  subroutine  WSTEST,  the  values  In  X are  arranged  In 
ascending  order  so  that  X(l)  Is  the  smallest  of  the 
values.  (See  NX.) 

Nuawrator  of  the  test  statistic  of  the  uniformity  test 
for  exponent lallty  (see  Equation  (3)). 

Denominator  of  the  test  statistic  of  the  uniformity 
test  for  exponent lallty  (see  Equation  (3). 

Transformed  D statistic  (see  Equation  (24)). 

(y/a)  of  Equation  (14) 

Dummy  variable  used  In  sorting  the  values  Into  ascend- 
ing order. 

Dummy  variable  used  in  calculation  of  W2  (see  W2). 

1 - (N+l)/2  used  in  calculation  of  D (see  Equation  (23)). 

3 

Sum  of  (X(i)  - AVE)  used  in  calculation  of  B1 
(see  Equation  (11)). 

Sum  of  [1  - (N+1)/2]X(1)  used  In  calculation  of  D (see 
Equation  (23)). 

(N+l-j)  used  In  calculation  of  R2  (see  Equation  (1)). 

Sum  of  Y9^  used  in  calculation  of  R2  (see  Y9). 

Sum  of  Y8xY9  used  In  calculation  of  R2  (see  Y8  and  Y9). 

X(l)  - AVE  used  in  calculation  of  SSQ,  Bl,  and  R2 
(see  SSQ  and  Equations  (1)  and  (12)). 

Difference  between  ln(N+l-l)  and  the  average  of  the 
ln(N+l-j)'s,  used  in  calculation  of  R2  (see  Equation 
(12)). 

Skewness  test  statistic  (see  Equation  (14)). 
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